Picture for Kevin Qinghong Lin

Kevin Qinghong Lin

Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering

Add code
Jun 01, 2026
Viaarxiv icon

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Add code
May 19, 2026
Viaarxiv icon

AI for Auto-Research: Roadmap & User Guide

Add code
May 18, 2026
Viaarxiv icon

Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation

Add code
May 13, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Add code
Apr 08, 2026
Viaarxiv icon

GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks

Add code
Mar 26, 2026
Viaarxiv icon

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Add code
Mar 25, 2026
Viaarxiv icon

Code2World: A GUI World Model via Renderable Code Generation

Add code
Feb 10, 2026
Viaarxiv icon

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Add code
Jan 07, 2026
Viaarxiv icon